Sensors 2014, 14, 9451-9470; doi:10.3390/sl40609451 



OPEN ACCESS 



sensors 

ISSN 1424-8220 

www.mdpi.com/journal/sensors 

Article 

Sparse Representation for Infrared Dim Target Detection via a 
Discriminative Over-Complete Dictionary Learned Online 

Zheng-Zhou Li Jing Chen \ Qian Hou \ Hong-Xia Fu \ Zhen Dai \ Gang Jin ^ 
Ru-Zhang Li ^ and Chang- Ju Liu ^ 

^ College of Communication Engineering, Chongqing University, Chongqing 400044, China; 
E-Mails: chenjingsunshing@gmail.com (J.C.); shuijingxin@gmail.com (Q.H.); 
fhxma092@gmail.com (H.-X.F.); daizhenlite@gmail.com (Z.D.) 

National Laboratory of Analogue Integrated Circuits, No. 24 Research Institute of China 

Electronics Technology Group Corporation, Chongqing 400060, China; 

E-Mails: lrzmail@163.com (R.-Z.L.); liucj@cetccq.com.cn (C.-J.L.) 

China Aerodynamics Research and Development Center, Mianyang 621000, China; 

E-Mail: gjin@ioe.ac.cn 

* Author to whom correspondence should be addressed; E-Mail: lizhengzhou@cqu.edu.cn; 
Tel.: -^86- 132-0601-57 17; Fax: -^86-023-65 10-3544. 

Received: 7 February 2014; in revised form: 8 May 2014 / Accepted: 14 May 2014 / 
Published: 27 May 2014 

Abstract: It is difficult for structural over-complete dictionaries such as the Gabor function 
and discriminative over-complete dictionary, which are learned offline and classified 
manually, to represent natural images with the goal of ideal sparseness and to enhance the 
difference between background clutter and target signals. This paper proposes an infrared 
dim target detection approach based on sparse representation on a discriminative 
over-complete dictionary. An adaptive morphological over-complete dictionary is trained 
and constructed online according to the content of infrared image by K- singular value 
decomposition (K-SVD) algorithm. Then the adaptive morphological over-complete 
dictionary is divided automatically into a target over-complete dictionary describing target 
signals, and a background over-complete dictionary embedding background by the criteria 
that the atoms in the target over-complete dictionary could be decomposed more sparsely 
based on a Gaussian over-complete dictionary than the one in the background 
over-complete dictionary. This discriminative over-complete dictionary can not only 
capture significant features of background clutter and dim targets better than a structural 
over-complete dictionary, but also strengthens the sparse feature difference between 
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background and target more efficiently than a discriminative over-complete dictionary 
learned offline and classified manually. The target and background clutter can be sparsely 
decomposed over their corresponding over-complete dictionaries, yet couldn't be sparsely 
decomposed based on their opposite over-complete dictionary, so their residuals after 
reconstruction by the prescribed number of target and background atoms differ very 
visibly. Some experiments are included and the results show that this proposed approach 
could not only improve the sparsity more efficiently, but also enhance the performance of 
small target detection more effectively. 

Keywords: dim target detection; adaptive morphological over-compete dictionary; 
discriminative over-complete dictionary; Gaussian over-complete dictionary; sparsity 



1. Introduction 

The distance between man-made satellites and ground-based EO sensors is usually more than 
30,000 km, and the angle between the object and ground-based EO sensor is so small that the target on 
a EO sensor is a small blob with only several pixels. Meanwhile, the energy of the object decays 
greatly for long distance propagation, and it is usually submerged in noise and clutter. This causes 
great difficulty for infrared dim small target detection and tracking [1,2]. The problem of how to 
effectively distinguish dim small targets from clutter has been widely studied over the past years, and a 
number of dim target detection algorithms have been developed and they can approximately be classified 
into two categories, namely, detection before track (DBT) and track before detection (TBD) [3-5]. Image 
filtering and content learning are the two basic methods of DBT-based target detection algorithms. The 
image filtering-based detection algorithms such as Top-Hat [6], TDLMS [7] and wavelet transform [8,9] 
usually whiten the image signal, and then determine whether there is a target or not in every scan by the 
amplitude threshold using some criteria, such as constant false alarm ratio (CFAR). The content 
learning-based target detection algorithms such as principal component analysis (PCA) compare the 
similarity of the image and the template pre-leamed by knowledge [10]. The TBD-based algorithms 
jointly process more consecutive scans and declare the presence of a target and its track by searching 
the candidate trajectory using an exhaustive hypothesis. Temporal cross product (TCP) is presented to 
extract the characteristics of temporal pixels by using a temporal profile in infrared image sequences, 
and it could effectively enhance the signal-to-clutter ratio (SCR) [11]. Higher detection probability and 
lower false alarm probability in every scan could not only facilitate analysis of characteristics, 
including movement analysis, but also simplify the computational complexity for TBD. 

The sparse representation decomposed on an over-complete dictionary is a newly-developed 
content learning-based target detection algorithm strategy [12-14]. In the over-complete dictionary, 
there are large number of atoms representing target and even background [15]. Gaussian [16,17] and 
Gabor [18] are the representative functions used to construct structural over-complete dictionaries 
offline. It is difficult for these structural over-complete dictionaries to suit the target and background 
with non-structure shape, so their representation coefficient vectors would be not sparse enough 
to distinguish targets from background clutter [19]. The adaptive morphological component 
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over-complete dictionary is constructed according to the image content, and it could enhance the 
sparsity of the representation coefficient vector. However, the atoms representing target and 
background are mixed together, and the sparsity of the representation coefficient vector is usually too 
lower to detect target signals [20], therefore, it is necessary to discriminate the atoms representing 
targets from the ones describing background clutter. The existing techniques to discriminate atoms 
usually choose background clutter to train a background over-complete dictionary, and manually select 
target signals to build a target over-complete dictionary. The discriminative over-complete dictionary 
trained manually could greatly improve the sparsity of the representation coefficient vector and also 
improve the performance of dim target detection. Nevertheless, its serious limitation is that the atoms 
couldn't adapt to moving targets and changing backgrounds effectively, so it is necessary to 
distinguish the atoms automatically for the discriminative over-complete dictionary in order to further 
enhance the capability of dim target detection. 

An infrared dim target detection approach based on sparse representation over discriminative 
over-complete dictionary learned online is proposed in this paper. An adaptive morphological 
over-complete dictionary is built according to infrared image content by a K- singular value 
decomposition (K-SVD) algorithm [21], and then a target over-complete dictionary is discriminated 
automatically from a background over-complete dictionary by the criteria that the atoms representing 
dim target signals could be decomposed more sparsely over a Gaussian over-complete dictionary than 
the one in a background over-complete dictionary. The remainder of the paper is organized as follows: 
the adaptive over-complete dictionary is trained in Section 2. It is further divided into target 
over-complete dictionary and background over-complete dictionary in Section 3. The sparsity-driven 
dim target detection approach is presented in Section 4. Some experiments are included in Section 5 to 
evaluate the performance of the discriminative over-complete dictionary and Gaussian over-complete 
dictionary, and the results show that the target detection performance achievable by the proposed 
approach is significantly enhanced. Conclusions are drawn in Section 6. 

2. Adaptive Morphological Component Dictionary 

Infrared dim target images consists of target, background and noise, and can be modeled as [5]: 

/ = /,+/;,+« target present ( 1 ) 

f - fb^^ target absent 

where / is the infrared image, ft ,fb, and n are the target, background and noise, respectively. Dim 

target detection is a two-category classification problem, i.e., pixels are labeled as target (target 
present) or background (target absent) based on their diverse characteristic differences. The dim target 
concentrates itself relatively in a small other than pixel-sized object region with uniform amplitude, 
and could be described by point spread function: 



x,y,k =a k •exp< 



y-y, k 

5y k 



(2) 
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where, a k is target intensity amplitude; denotes the target location at instant k, and and 

y, represent the horizontal and vertical direction, respectively; k and k are the extent 

parameters at horizontal and vertical direction, respectively, and they are usually several pixels for 
their angle is very small when the distance between the man-made satellite and ground-based EO 
receiver is 30,000 km. Moreover, the signal-to-noise (SNR) is always also low because the target's 
energy decays greatly when it propagates in noise and clutter. 

The K-singular value decomposition (K-SVD) algorithm is adopted to learn the image content and 
train adaptive over-complete dictionary D from a large number of infrared dim images. The dictionary 
is trained by the following formula [22] : 

r,D =argimn x;||r|L+E||Dr-/||2 (3) 



where \\.\\^ denotes - norm, which is defined as the number of nonzero entries in the vector, and \\.\\^ 
denotes - norm , which is defined as the energy of the vector. The formula means the infrared image/ 
could be decomposed on the over-complete dictionary D, and the representing coefficient vector 7 
has the least number of nonzero entries under the constraint that the residual ||/ -Dyfj is the minimum. 

Obviously, the sparsity of the representation coefficients would be dominated by the over-complete 
dictionary D. 

The two terms D and 7 should be solved simultaneously in Equation (3). The construction of the 
adaptive over-complete dictionary is an iterative process, and there are two stages in every iteration, 
namely, sparse coding and dictionary update [23]: 

(1) Sparse coding. In this stage, assuming that D is fixed; the sparse representation 7 is updated by 

solving the following formula: 

argmin||y||p 5.?. ||/-I>y||2<^ (4) 

It means that the sparsest representation vector 7 is searched by an orthogonal matching pursuit 
(OMP) algorithm under the constraint that the residual HZ-Dylj would be less than the error tolerance 
e [24,25]. 

(2) Dictionary D update. During this stage, only one atom in the dictionary D is updated at 
every iteration, and then the residual is estimated by: 



2 



(5) 



Therefore, an approximate solution set {dk,yk) would be optimized by the SVD algorithm. Repeating 



Equation (5), every atom dk in the dictionary D will be updated until the residual 



f-lLdk7k 



is less 



than the error tolerance e. The signal representation error decays exponentially with increasing 
iteration number, and the adaptive dictionary would be constructed after a few iterations. The final 
version dictionary D, which called adaptive morphological over-complete dictionary, would be 
compatible with the content of infrared image. 

In the morphological over-complete dictionary D, the atoms representing target and background are 
mixed together. This could induce two difficulties for the signal is decomposed based on not only the 
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target over-complete dictionary, but also the background over-complete dictionary. One is that the 
representing coefficients mightn't be sparse. The other is that the representing coefficients are irregular 
and couldn't easily discriminate target from background. To better illustrate the challenge, an example 
is shown in Figure 1. There is a target in Figure la. Its representation coefficient decomposed on the 
over-complete dictionary D is shown in Figure lb. The dictionary contains 1,144 atoms. There are 
many nonzero coefficients on atoms representing target and background, and the four atoms 
corresponding to the largest four nonzero entries are shown in Figure Ic. The first atom represents the 
target, and the other three atoms describe the background. Therefore, it is necessary to discriminate the 
atoms representing the target from the ones describing background clutter. 

Figure 1. Decomposition of a target image block on adaptive morphology over-complete 
dictionary, (a) Target image block, (b) Representation coefficient, (c) Four atoms with 
maximum coefficients. 



1 

03 




atom d 



(a) (b) (c) 

3. Discriminative Over- Complete Dictionary Constructed Online 

The sparse representation model assumes that the signal could be reconstructed with the same type 
of over-complete dictionary and corresponding representation coefficients [26]. For the background 
signal , it can be represented by a linear combination of the background atoms: 

/, ^a.^+M^+.-. + a^^^ (6) 

= [ , ^2 ,. . . , J* J [«, , ^2 , . . . , fl-^^ 'J = D^a 

where Db is the background over-complete dictionary, A^;, denotes the account of atoms in the 
dictionary Db, and a is the representing coefficient vector whose entries are the abundances of the 

corresponding atoms in Db, i.e., sparse vector or a vector with only few nonzero entries. 

Similarly, for the target signal /, , it can also be sparsely represented as a linear combination of the 

target atoms: 

/, «Ai^^i'+A^2+- + A,< (7) 
= [<,^/;,...,<][A,A,...,A^J'=d,p 



Sensors 2014, 14 



9456 



where Dt is the target over-complete dictionary, Nt is the number of atoms in the target dictionary Dt 
and P is the sparse representing coefficient vector. An infrared image lies in the union of the 
background dictionary and target dictionary, i.e.. 



/ = D,a + D,p= D, D, 



I>7 



(8) 



where D = [Db Dt] is a matrix consisting of both background and target atoms, and y = [a p] is a 
(Nt + Nb) - dimentional vector consisting of the two vectors a and p associated with the two 
dictionaries. 

Since background clutter and the target signal usually consist of different materials, they have 
distinct signatures and thus lie in different over-complete dictionaries. If an infrared image is a target 
signal, it ideally can be represented by a target dictionary, but can't be represented by the background 
atoms. In this case, a is a zero vector and p is a sparse vector. Therefore, the infrared image can be 
sparsely represented by the union of background over-complete dictionary and target over-complete 
dictionary, and the location of nonzero entries in the sparse vector y actually contains critical 
information about the class of the infrared image. 

The existing techniques to discriminate atoms usually choose background clutter to train a 
background over-complete dictionary, and manually select target signals to build a target 
over-complete dictionary offline. Nevertheless, the serious limitation of such an offline discriminative 
over-complete dictionary is that the atoms couldn't effectively adapt to moving targets and changing 
backgrounds, which would induce the discrimination between target and clutter to be too small to 
distinguish targets from background clutter, so it is necessary to distinguish automatically the atoms 
online for the discriminative over-complete dictionary in order to further enhance the capability of dim 
target detection. 

As the Introduction discusses, the distance between a geosynchronous satellite and a ground-based 
EO receiver is more than 30,000 km, and the angle is so small that the target on the EO sensor is a 
small blob with only several pixels with Gaussian distribution, so dim small targets usually are 
described by a two-dimensional Gaussian intensity model (GIM), which is widely used to describe 
infrared dim small targets: 

(9) 



where (iojo) is the the center of the blob, is the intensity at the position {ij), /max is the intensity of 
the peak, al and o-J are the extent parameters at horizontal and vertical directions, respectively. 

The extent parameters control the target intensity spread degree, and it represents a small point 

when they are very little, and denote a flat block when they are very large. In content learning-based 

target detection algorithms, the correlation between infrared image and GIM is usually measured to 

decide whether there is dim target or not. In this paper, the GIM-based structural over-complete 

dictionary is adopted to test the atoms of adaptive morphological component dictionary, and 

automatically discriminate the atoms representing target from the ones describing background clutter. 

There are four parameters in GIM, namely, center position (/oJo)> peak intensity /max, horizontal and 
vertical extent parameters al and <tS and they are adjusted to generate a large number of diverse 
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atoms with different position, brightness and shape. Figure 2 is a part of the atoms of Gaussian 
over-complete dictionary D . , and each atom has 7x7 pixels. 

Figure 2. Gaussian over-complete dictionary. 



Although a dim moving target is always polluted by environmental noise, it approximately affords a 
two-dimensional Gaussian model. Otherwise, different background clutter has diverse and abundant 
morphology. For example, cirrocumulus cloud is composed of small spherical clouds, which arrange in 
rows or groups; stratocumulus cloud is generally larger and looser, and its thickness and shape are also 
different. Figure 3 is the representation coefficients of target signal and background noise decomposed 
based on a Gaussian over-complete dictionary. The target signals and background noise are from a 
deep space image. Figure 3a and 3b are the target image block and its representation coefficients, 
respectively. The target signal is a bright blob with a black noise at its center, and it is very similar to a 
two-dimensional Gaussian function. There are only several nonzero representation coefficients, and the 
target signal could be decomposed sparsely by a Gaussian over-complete dictionary. Figure 3c and 3d 
are background noise and its representation coefficients, respectively. Much of its representation 
coefficients are nonzero, and the background noise should be reconstructed by many Gaussian atoms. 

Figure 3. Representation coefficient of a deep space image decomposed based on a 
Gaussian over-complete dictionary, (a) and (b) are target signal and its sparse coefficient, 
respectively; (c) and (d) are background noise and its sparse coefficient, respectively. 





(a) 



(b) 
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Figure 3. Cont. 
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A small infrared target in a cloud background and their representation coefficients decomposed 
based on a Gaussian over-complete dictionary are shown in Figure 4. Figure 4a and 4b are the target 
signal and its representation coefficients, respectively. The target signal is a bright rectangle, and it has 
more nonzero representation coefficients than that of the target signal similar to the two-dimensional 
Gaussian function shown in Figure 3. Therefore, a target signal like a bright rectangle could be 
reconstrcted by a few of Gaussian atoms with maximum nonzero representation coefficients. Figure 4c 
and 4d are the cloud background and its representation coefficients, respectively. It is shown that much of 
these coefficients are nonzero, and it couldn't be decomposed sparsely. Moreover, the representation 
coefficient difference between target signal and background is so small that it is hard to distinguish the 
target from background clutter. 

Figure 4. Representation coefficient of cloud image on Gaussian over-complete dictionary, 
(a) and (b) are target signal and it sparse coefficient, respectively, (c) and (d) are cloud 
background and its sparse coefficient, respectively. 
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Therefore, compared with the background atom in the adaptive morphological component 

dictionary, the target atom could be reconstructed from a lesser amount of Gaussian atoms. In other 

words, with the same number of Gaussian atoms with maximum sparse coefficients, the residual of a 

background atom would be much greater than that of a target atom. Based on this idea, the atom in the 

adaptive morphology over-complete dictionary D could be classified as target over-complete 

dictionary Dt and background over-complete dictionary Db. The atom dk is decomposed on Gaussian 
over-complete dictionary D as follows: 

1 J gaussum 

d = arg min \d, - D„„„„. all s.t. 

o \\ h gaussian ||2 



Ho=^ 



(10) 



where dk is an atom of D. After decomposing for the fixed k times, the residual r{dk) of the atom dk is 
defined as: 



r d,, = Wd,, -D .a 

k II « gaussian \\2 



(11) 



Whether the atom dk is target atom or not could be decided by comparing the residual r{dk) with a 
threshold d: 



jJ.eD, r d, <S 
4 r d, >5 



(12) 



The threshold 3 usually is proportional to the size of the atom. Every atom of D could be identified, 
then the adaptive background over-complete dictionary Db and target over-complete dictionary Dt 
could be constructed online and automatically. Figure 5 is an example of the discriminative 
over-complete dictionary learned online. Figure 5a is a part of adaptive morphological dictionary D for 
space target image. Figure 5b shows five target atoms. The target atoms have abundant shapes, and 
they can more really reflect the morphological characteristics of the original dim target than Gaussian 
atom. Five background atoms are listed in Figure 5c, and all of them are noise. 



Figure 5. Discriminative over-complete dictionary for space image, (a) Dictionary D, 
(b) Target atom of Dt, (c) Background atom of Db. 




....... . ■ ■ . . .-T**; . \ 



(b) 



(a) 



(c) 



Sensors 2014, 14 



9460 



4. Dim Target Detection Criteria 

Once the target and background over-complete dictionaries D; and D^, are learned and constructed 
online through these above procedures, the image / could be decomposed on these two dictionaries 
with the error tolerance o, respectively: 

P = argmin||p[ s.t. ||D,p-/||^<f7 (13) 

a = argmin||a||g s.t. ||D^a -/l^ < o" (14) 

These formulas are approximately solved by greedy pursuit algorithms such as orthogonal matching 

pursuit (OMP) or subspace pursuit (SP). The target can be sparsely decomposed over the target 

over-complete dictionary, yet it can't be sparsely decomposed on the background over-complete 

dictionary. For target signals, the residual reconstructed by target atoms with maximum representation 

coefficients would be less than that reconstructed by the same number of background atoms with 

maximum representation coefficients. Let us define the residual reconstructed by target over-complete 
dictionary as / : 



(15) 



where denotes the recovered sparse coefficient for / associated with the target over-complete 
dictionary, and m is a prescribed constant, such as five. Similarly, background clutter can be sparsely 
decomposed on the background over-complete dictionary, yet it can't be sparsely decomposed on the 
target over-complete dictionary. For background clutter, the residual reconstructed by target atoms 
with maximum representation coefficients would be larger than that reconstructed by the same number 
of background atoms with maximum representation coefficients. Let us define the residual 
reconstructed by background over-complete dictionary as f : 



(16) 



where a, denotes the recovered sparse coefficient for/ associated with the background over-complete 
dictionary. 

The sparsity-based dim target detector can be done by comparing the difference of residuals with a 
prescribed threshold, i.e., 

D f =r, f -r, f (17) 

If D(f) > ;/,/ would be labeled as target; otherwise, it would be labeled as background clutter. // is a 
prescribed threshold. 

5. Experimental Results and Analysis 

The following experiments have been implemented in MATLAB language on personal computer 
with a Pentium dual-core CPU E5900. Figure 6 shows the low contrast infrared images, which are 
captured outfield by an EO imaging tracking system. Figure 6a is the deep space sequence image. 
Figure 6b is the cloud sequence image, and Figure 6c is the multi-target image. Noise and cloud are the 
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background clutter of deep space images, multi-target images and cloud images, respectively. The 
target is the brighter maculous form at the center of rectangle box. Their signal-to-noises (SNRs) are 
about 2.3 and 3.5 in Figure 6a and 6b, respectively. In Figure 6c, three targets with different scale are 
marked, and their SNRs are different. 

Figure 6. Original infrared image, (a) Deep space image, (b) Cloud image, 
(c) Multi-target image. 



Figure 5, 7 and 8 are the morphological over-complete dictionary for the deep space images, cloud 
images and multi-target images, respectively. Compared with the Gaussian over-complete dictionary 
shown in Figure 2, the adaptive morphological over-complete dictionary has more diverse and 
abundant morphology, and it would be more suitable to represent original images with less atoms. 
Every atom is 7 x 7 pixels. 

The space image and cloud image are decomposed on a Gabor over-complete dictionary (GD), 
Gaussian model over-complete dictionary (GMD), and adaptive morphological over-complete 
dictionary (AMCD), and then they are reconstructed by the five atoms with maximum representation 
coefficients.^ The first two are structural dictionaries, and the last one is a non-structural dictionary. 
The residual energy between the original image and the reconstructed image is introduced to evaluate 
the capability of sparse representation. 

Obviously, the smaller the residual energy is, the more powerful the sparse representation would be. 
Ten target image blocks and ten background image blocks in space image and cloud image are 
decomposed and reconstructed, and their residual energy (not normalization) are shown in Figure 9. 
For the twenty image blocks, the residual energy of AMCD is the minimum, and that of GD is the 
maximum. This figure indicates that structural dictionary AMCD, which is trained according to image 
content, could more effectively describe the morphological component than these non-structural 
dictionaries GD and GMD, and its sparse representation ability is the most powerful. 




(a) 



(b) 



(c) 



Sensors 2014, 14 



9462 



Figure 7. Adaptive over-complete dictionary for cloud image, (a) Dictionary, (b) Target 
atom, (c) Background atom. 




(a) 

The discriminative over-complete dictionaries for space image, cloud image and multi-target image 
are shown in Figure 10. Figure 10a and 10b are the target over-complete dictionary and background 
over-complete dictionary for space image, respectively; Figure 10c and lOd are the target 
over-complete dictionary and background over-complete dictionary for cloud image, respectively; 
Figure lOe and lOf are the target over-complete dictionary and background over-complete dictionary 
for multi-target images, respectively. The threshold 5 used to distinguish target over-complete 
dictionary and background over-complete dictionary is equal to three multiplied by size of the atom, 
and the parameter k is equal to five. The target atoms are bright points, which are located at various 
positions of the image blocks with diverse shapes. The background atoms are noise and cloud clutter 
for space image, cloud image, and multi-target image respectively. There are 250 target atoms and 894 
background atoms for deep space image, 280 target atoms and 884 background atoms for cloud image, 
and 526 target atom and 498 background atoms for multi-target image. For the space image, three 
background atoms and two target atoms are wrongly identified as target atom and background atoms, 
respectively; for cloud image, there are 22 background atoms and nine target atoms are wrongly 
classified as target atom and background atom, respectively; for multi-target image, there are seven 
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background atoms and six target atoms are wrongly classified as target atom and background atom, 
respectively. The correct probabilities for space image, cloud image and multi-target image are more 
than 98%, 91%, and 95%, respectively, yet, depending on the complex degree of background clutter, 
the detection probability by the criteria based on Gaussian over-complete dictionary fluctuates. 

Figure 9. Residual energy after image reconstruction, (a) and (b) are target blocks in space 
image and cloud image, respectively, (c) and (d) are background blocks in space image and 
cloud image, respectively. 




(c) (d) 

Figure 10. Discriminative over-complete dictionary, (a) and (b) are target dictionary and 
background dictionary for space image, respectively; (c) and (d) are target dictionary and 
background dictionary for cloud image, respectively; (e) and (f) are target dictionary and 
background dictionary for multi-target image, respectively. 
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Figure 10. Cont. 




The representation coefficients of the target image blocks decomposed on this discriminative 
over-complete dictionary are shown in Figures 11-13. In Figures 11-13, Figures lla-13a are target 
signals, and Figures llb-13b and Figures llc-13c are the coefficients on corresponding background 
over-complete dictionary and corresponding target dictionary, respectively. There are many nonzero 
coefficients in Figures llb-13b of Figures 11, 12 and 13, and this means that the signal couldn't be 
sparsely represented by background over-complete dictionary. In Figures 11-13, Figures llc-13c have 
less nonzero coefficients than that of Figures llb-13b, and it indicates the target image blocks could 
be sparsely represented by target over-complete dictionary. Moreover, the residual reconstructed by 
background over-complete dictionary is much than that of target over-complete dictionary with the 
same m atoms, Here, m is equal to five. The target signal could be reconstructed by five target atoms 
with maximum nonzero coefficients in the corresponding target over-complete dictionary, and the 
residuals of deep space image, cloud image and multi-target image are very small, about 0.13, 0.17 and 
0.09 (normalization), respectively. Otherwise, their residuals after reconstructed using five background 
atoms with maximum nonzero coefficients in corresponding background over-complete dictionary are 
very big, and they are 0.94, 0.91 and 0.89, respectively. The residuals reconstructed by corresponding 
target over-complete dictionary Vtij) are less than that constructed by corresponding background over- 
complete dictionary rylj), and the image would be correctly labeled as target. 
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Figure 11, Representation coefficient of target signal in deep space image decomposed on 
discriminative over-complete dictionary, (a) Target signal, (b) Sparse coefficient on 
background dictionary, (c) Sparse coefficient on target dictionary. 
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Figure 12. Representation coefficient of target signal in cloud image decomposed on 
discriminative over-complete dictionary, (a) Target signal, (b) Sparse coefficient on 
background dictionary, (c) Sparse coefficient on target dictionary. 
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The representation coefficient of the background image blocks decomposed based on this 
discriminative over-complete dictionary are shown in Figures 14-16. In Figures 14-16, 
Figures 14a-16a are background noise, and Figures 14b-16b and Figures 14c-16c are the 
representation coefficients based on the corresponding background over-complete dictionary and 
corresponding target over-complete dictionary, respectively. In Figures 14, 15 and 16, There are some 
nonzero coefficients on atoms in Figures 14b-16b and Figures 14c-16c, and Figure 14b-16b has less 
nonzero coefficients than that of Figures 14c-16c, and it indicates the background image blocks could 
be represented based on the corresponding background over-complete dictionary more sparsely than 
based on the corresponding target over-complete dictionary. Moreover, the residual reconstructed by 
the background over-complete dictionary is less than that of the target over-complete dictionary with 
the same atoms. The background image could be reconstructed by five background atoms with 
maximum nonzero coefficients in the corresponding background over-complete dictionary, and the 
residuals of deep space image, cloud image and multi-target image are about 0.17, 0.28 and 0.10 
(normalization), respectively. The residuals reconstructed using five target atoms with maximum 
nonzero coefficients in corresponding target over-complete dictionary is 0.64, 0.83 and 0.75, 
respectively. Their residuals are r,(/) > rt(f), and the image should be labeled as background. 
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Figure 13. Representation coefficient of target signal in multi-target image decomposed on 
discriminative over-complete dictionary, (a) Target signal, (b) Sparse coefficient on 
background dictionary, (c) Sparse coefficient on target dictionary. 
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Figure 14. Representation coefficient of background block in deep space image 
decomposed on discriminative over-complete dictionary, (a) Sparse coefficient on 
background dictionary, (b) Sparse coefficient on target dictionary. 
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Figure 15. Representation coefficient of background block decomposed in cloud image on 
discriminative over-complete dictionary, (a) Sparse coefficient on background dictionary, 
(b) Sparse coefficient on target dictionary. 
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Figure 16. Representation coefficient of background block in multi-target image 
decomposed on discriminative over-complete dictionary, (a) Sparse coefficient on 
background dictionary, (b) Sparse coefficient on target dictionary. 



(a) 




0 50 1D0 150 200 250 30D 350 400 450 

atom d 



10D 150 200 250 30D 350 400 450 500 

atom d 



(b) 



(c) 



The receiver operating characteristic (ROC) curves of the Gaussian over-complete dictionary, 
discriminative over-complete dictionary and wavelet algorithm are shown in Figure 17. The ROC 
curve describes the probability of detection (PD) as a function of the probability of false alarms (PF). 
The PF is calculated by the number of false alarms (background pixels determined as target) over the 
number of background samples, and the PD is the ratio of the number of hits (target pixels determined 
as target) and the total number of true target samples. The extent parameters a] and a] in the two 
dimensional Gaussian model are extended to some degree, background clutter, even flat background, 
could be represented sparsely by the Gaussian over-complete dictionary. Hence, PD and PF would be 
increased with the extent parameters increasing. From the ROC plots, the discriminative over-complete 
dictionary outperforms the Gaussian over-complete dictionary, i.e., the PD of the former is larger than 
that of the latter when their PFs are same. Meanwhile, the wavelet algorithm is the worst one among 
the three algorithms for the example images. 

Figure 17. ROC curves of target detection. 
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6. Conclusions 



This paper proposed an infrared dim target detection approach based on a sparse representation on a 
discriminative over-complete dictionary. This non- structural over-dictionary adaptively learns the 
content of infrared images online and is further divided into a target over-complete dictionary and a 
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background over-complete dictionary automatically. The target over-complete dictionary could 
describe target signals, and the background over-complete dictionary would represent background 
clutter. This discriminative over-complete dictionary not only can capture significant features of 
background clutter and dim targets better than a structural over-complete dictionary, but also can 
efficiently strengthen the sparse feature difference between background clutter and target signals better 
than a discriminative over-complete dictionary learned offline and classified manually. The 
experimental results show that this proposed approach could effectively improve the performance of 
small target detection. 

When SNR is lower than one, a target signal would be submerged in the strong noise, and its shape 
would be polluted and be not represented simply by a Gaussian model. Future work would focus on 
how to more effectively distinguish the target over-complete dictionary and background over-complete 
dictionary from adaptive morphological over-complete dictionary for diverse target signals and 
background clutter even in low SNR. Moreover, the computation time of the discriminative 
over-complete dictionary to detect target signal is about 6.4 s for a frame image, and this proposed 
algorithm should be optimized to decrease the computation complexity. 
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